How Many Templates for GW Chirp Detection? The Minimal-Match Issue Revisited 



R. P. Croce, Th. Demma, V. Pierro, and I. M. Pinto 

Wavesgroup, University of Sannio at Benevento, Italy 



M. Longo, S. Marano and V. Matta 
D.I. 3 E., University of Salerno, Italy 

In a recent paper dealing with maximum likelihood detection of gravitational wave chirps from 
coalescing binaries with unknown parameters we introduced an accurate representation of the no- 
signal cumulative distribution of the supremum of the whole correlator bank. This result can be used 
to derive a refined estimate of the number of templates yielding the best tradeoff between detector's 
performance (in terms of lost signals among those potentially detectable) and computational burden. 
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Optimizing the design of the correlator-bank for detecting gravitational wave (henceforth GW) chirps from coa- 
$— i ' lescing binaries (henceforth CB) with unknown parameters is a key and yet not settled problem. The main goal is to 
Qh, identify and adopt the best tradeoff between detector's performance and computational cost. In a recent paper p| we 
introduced an accurate representation of the cumulative distribution function (henceforth CDF) of the supremum of a 
bank of correlators in the null hypothesis taking into account the nonzero covariance among them, extending previous 
■ results by Mohanty Q- These findings can be used to re-address the following questions: i) whether it really makes 
sense to use the largest number of correlators compatible with the available computing power, and ii) whether the 
04 i simple argument Q whereby the fraction of potentially observable sources which would be lost as an effect of template 
sparsity is (1 — T 3 ), T being the minimal match Q is still valid when one takes into account both the dependence 
of the detection threshold on the total number of correlators, and the covariance among the correlators, at a given 
false-alarm level. The above issues are discussed in Sect. 3 and 4 below, while Sect. 2 is aimed at introducing the 
, relevant notation. 
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O . II. BACKGROUND 

"o 

The maximum likelihood (henceforth ML) strategy for detecting GW chirps from CBs with unknown parameters 
5^ ' goes through the following steps. First, a set of reduced templates^ is set up, viz.: 

>: T(/,£ T )=T r 7 / 6 expbY,(/,fr)], (2.1) 

•i-H . 

where ^(/, £r) is the (post-newtonian) chirp phasing function £p represents the intrinsic source parameters 
(binary companion masses and spin parameters), and To is a suitable constant. Second, a set of non-coherent reduced 
■ - - ■ correlators 

C[A, T] = sup T f ^ T K A [f, £ T ] (2.2) 

r 

is computed, where T is the Fourier-transform operator and: 

. \ 2 A{fy£MM fe(f n 

K A \JM = \ ff[J) ' /M/*,/ s ) (2.3) 

[ 0, elsewhere. 

In eq. I|2.3(l A(f) — S(f) + N(f) are the (spectral, noisy) data, resulting from the superposition of a (possibly null) 
signal S(f) and a realization N(f) of the antenna noise, (/j, f s ) is the antenna spectral window, and n(/) is the 
(one-sided) antenna noise power spectral density (henceforth PSD). The Fourier transform in 12.2fl is implemented 
using a discrete finite algorithm (DFT). The corresponding number of coalescency times will be henceforth denoted 
by A^e; each reduced template is thus used to compute Nq (non-reduced) correlators. 
Once all correlators have been evaluated, the final step consists in comparing the supremum 



A = supC[A,T] 



(2.4) 
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(or any nondecreasing function thereof) to a threshold 7 depending on the prescribed false-alarm probability 

a = prob[A > 7|no signal]. (2-5) 

The threshold 7 can be accordingly computed from the no-signal cumulative distribution of A. Whenever A > 7 
one declares that a signal has been observed (within false alarm probability a), and estimates the unknown source 
parameters from those of the template yielding the largest correlator. 
On the other hand, it can be shown that the false-dismissal probability 



j3 — prob[A < 7 j nonzero signal] 



is a decreasing function of the match 

M[S,T] = 
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between the signal S and the (most similar available) template T, where: 

/- 7 / 3 exp [j [^(/, &) - V(/, &)] } 



Ks[f,tT] = 



0. 



n(/) 

elsewhere. 
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The match is a measure of the similarity between S and T, and is obviously equal to one iff S = T. The reduced- 
template set should be accordingly designed so as to cope with the more or less obvious prescription 



VS G S,3T e C : M[S,T] > V, 
where S and C are the allowed signal and template sets, respectively, and T is the minimal match 



(2.9) 



III. MINIMAL MATCH VS. NUMBER OF TEMPLATES 



The natural question is how many reduced templates one should use in practice. The standard argument suggests 
using the largest total number Na of (reduced) templates in the allowed (reduced) parameter-space search subset, 
compatible with the available computing power, so as to achieve the largest-minimal-match T over the class of admis- 
sible sources. 

In this connection, it should be stressed first that a visible knee point in the T vs. Na curve invariably exists at some 
value Na = N*, beyond which any further increase in Na yields only a modest increase in F. 

This is illustrated in Fig. 1 for the simplest case of the one-parameter family of (reduced) newtonian waveforms in the 
range of chirp masses (0.2Af Q < M. < 10M Q ). To compute the curve in Fig. 1 we used the (initial) LIGO noise PSD 

1. 

w) = i{ 1 + (f) J + 2 (Sl <"> 

with fo = 300 Hz, fa — 40Hz and f s = 400Hz; we sampled the data at 3.2KHz (4-times the Nyquist rate), and let 
A*e = 2 21 , which corresponds to the longest-lived source allowed (A4 = 0.2M©). 

A knee point separating intervals of different (almost constant) slope in the T versus Na curve occurs in Fig. 1 at 
Na ~ 5 ■ 10 3 . Except for the numerical value of AT*, the existence of the above knee point is a generic feature of the 
r vs. Na curve, occurring also for more realistic, higher-order post-newtonian waveforms and templates. 



IV. UNDETECTED OBSERVABLE SOURCES VS. NUMBER OF TEMPLATES 

It was first argued in Q that (under the simplest assumption of isotropically and homogeneously distributed sources) 
the fraction of potentially observable sources which would be undetected as an effect of template mismatch is (1 — T 3 ). 
This argument conceals an obvious naivety insofar as it neglects the dependence of the detection threshold 7 corre- 
sponding to a given false-alarm probability a on the total number of templates N = NqNa, and hence on T itself. 
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In order to gain insight into this issue, it is expedient to work in terms of the detection probability Pd, and capitalize 
on the accurate approximate representations for the no-signal cumulative distribution of the whole-bank suprcmum 
obtained in pj, which is needed to compute the threshold 7 corresponding to a given a. 
Accordingly, let 



P d = 1 - (3 = 1 - Q[ 7 - E(A)} 



(4.1) 



where Q(-) is an unknown CDF, whose density can be nonetheless safely assumed as being unimodal and center- 
symmetric, at least for meaningful values of the signal-to-noise ratio. In the worst case of minimal match, 



E(A) w Td, 

where d is the signal deflection (signal-to-noise ratio), viz.: 

^ S(f)S*(f) 

u n (/) 
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(4.2) 
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and we assume d 3> 1. Let the observable sources be those for which Pd > 0.5. Under the above assumptions for Q(-), 
these correspond to: 



(4.4) 



The deflection d on the other hand can be written 



A K 

d= R 



(4.5) 



where R is the source distance, and if is a constant depending on the antenna and source features, as well as their 
mutual orientation. Equation H4.4JI allows to define the radius R of the antenna-centered sphere which contains all 
sources of a given class (seen by the same antenna, sharing the same features and orientation) producing a deflection 
d > 7 (i.e., observable), viz.: 



R 



KT 
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(4.6) 



Hence the number of observable sources in the given class, under the most obvious hypothesis of uniform (homogeneous 
and isotropic) spatial source density p s is: 



KT 
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(4.7) 



Letting T maxi N max the largest achievable minimal match, and the related number of templates, and denoting as 



N. 



(max) 



the corresponding value of N s , one has from l|4.7|) : 
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N. 
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7 (a, N max ) 
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(4.8) 



where the dependence of 7 from a and N has been written explicitly. The quantity 1 — rj represents the fraction 
of observable sources which would be lost due to poor minimal match (r < T max ). Note that the ratio rj does not 
depend on K. 

Neglecting the dependance of 7 on N (and hence on T), the second factor on the r.h.s. of l|4.8|l cancels, and we are 
left with 



(4.9) 



which does not depend on a. Under the same assumption, N s in H4.7(l increases monotically with T, attaining its 
supremum ]\[( max ) a t T max = 1, yielding r/ = T 3 . This is, in essence, the simplest argument introduced in Q. 
In order to refine this reasoning one should obviously include the dependence of 7 on N, which in turn is affected 
by the covariance among the correlators [1|. For illustrative purposes, we shall again stick at the simplest case of 
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newtonian signals and templates, using the same parameters as in Fig. 1, and use the best available approximant 
for the no-signal CDF of the detection statistic A from 1] in order to compute the ratio ^(a, N max ) /^(a, N) on 
the r.h.s. of Ij4.8|l 9j. In Fig. 2 we accordingly compare 770 (eq. 14.9JI . dashed line), to 77 (eq. I|4.8|) . solid lines), 
for a — 10~ fe , k — 2,3,4 (top-to bottom), as functions of In Fig. 2 we assumed T max = 0.99 as a bona fide 
practical limit value, and computed N max accordingly. From Fig. 2 it is seen that a knee-point in the curves exists at 
TVa ~ N* « 5 • 10 3 . It is also seen that rj exceeds by a non-negligible 5% typ. the plain estimate 770, for Na ~ iV*, if 
the best available approximant for the no-signal CDF of the detection statistic A is used to compute the threshold 10]. 
It is also interesting to compare in terms of 77 alternative (less accurate, i.e., yielding larger thresholds for the same 
a and N) approximants for the no-signal CDF of the detection statistic A. This is obtained by computing ]\[( max ") 
using the best available approximant, i.e., using this latter to compute j(a, N max ) in l|4.8[l . The resulting curves 
are shown in Fig. 3, where again we assumed T max — 0.99 as a bona fide practical limit-value, and computed 
Nmax accordingly. Specifically, the (scaled) quantities j/ ' 1 ' 2 ) in Fig. 3 correspond to ignoring the covariance 
among the correlators (dotted line), and to including the covariance among nearest-neighbour correlators along one 
(coalescency-time, dashed line) or both (coalescency-time and chirp-mass, solid line, best available approximant) 
coordinates of the newtonian parameter-space, respectively For each case, the 77 vs. Na curves corresponding to 
a = 10~ 2 , 10~ 3 , 10~ 4 are displayed. It is seen that the dependence of r\ on a becomes the less relevant, the better 
approximant is used. 



V. CONCLUSIONS 

On the basis of the above findings, the following conclusions can be drawn. Inspection of Fig.s 1 and 2, shows a 
consistent evidence that increasing the number of templates beyond a critical value corresponding to the knee-point 
in the curve of T vs. Na (Fig. 1), does not produce a sensible increase in the detectable fraction of potentially 
observable sources, at the expense of a marked growth of computational load. On the other hand, using the presently 
most accurate available representation of the no-signal cumulative distribution of the (whole-bank) detection statistic 
to compute the threshold results in a sizeable increase (> 5%) in the detectable fraction of potentially observable 
sources over the naive oc T 3 estimate. 

Both findings above support the current trend Q toward the use of hierarchical search strategies for best tradeoff 
between detector's performance and computational burden. 
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Captions to the Figures 



Figure 1 - Minimal match vs. number of reduced templates in 0.2M Q < M. < 1OM . Newtonian waveforms, Ligo-I 
noise. 



Figure 2 - Detectable fraction of potentially observable sources, in 0.2M© < A4 < 10M Q . rjo (eq. (|4.9|l . dashed line) 
and r\ (eq. (|4.8|l . solid lines) vs. for a = 10~ fc , k = 2,3, 4 (top-to bottom). Newtonian waveforms, Ligo-I noise. 



Figure 3 - Detectable fraction of potentially observable sources, with 0.2M Q < .A/f < 10M Q vs. number of reduced 
templates at different false-alarm levels, using different models of the no-signal cumulative distribution of the detection 
statistic. Newtonian waveforms, Ligo-I noise. 



